A guide to numeric measures

نویسنده

  • Andrei Popescu-Belis
چکیده

Computer programs are increasingly capable of grouping together expressions of a discourse that denote the same entity. We regard the evaluation of this capacity as the comparison between a system’s response and the one expected by the evaluators. We outline a theoretical framework for reference (§ 1) and another one for evaluation (§ 2), then analyze three existing quality measures (§ 3.2– 3.4), one of which was used in the MUC evaluation campaign. We propose mainly two new measures, one based on the notion of core equivalence class (§ 3.5), and the other based on information theory (§ 3.6), both showing better theoretical coherence than the previous ones. We also examine two alternatives, the exclusive core classes (§ 3.5.4) and the distributional measure (§ 3.7). In addition, we study a series of generalizations to the main problem (§ 4), and provide the results of all measures on several texts ( 5). Language-related tasks often require a certain degree of language understanding. Following a commonsense conception, understanding a linguistic message will be equated here with: (1) understanding what entities the message talks about; (2) understanding what the message says about these entities (their properties, relations, etc.) Our main goal is to estimate the capacity of a computer program to “understand references”, that is, to keep track of the various entities that a linguistic message is about. We will first describe a broad framework for this phenomenon, and situate the problem of coreference within it. 1. A FRAMEWORK FOR REFERENCE USE

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Correlation of Single Assessment Numeric Evaluation (SANE) with other Patient Reported Outcome Measures (PROMs)

Background: The Single Assessment Numeric Evaluation (SANE) is a simple, one-question patient-reported outcomemeasure (PROM). We systematically reviewed correlations between SANE and more extensive PROMs.Methods: We identified studies with correlation coefficients between SANE and other shoulder, knee, and anklespecificPROMs. We calculated mean, median and range across studies...

متن کامل

Numeric Multi-Objective Rule Mining Using Simulated Annealing Algorithm

Abstract as a single objective one. Measures like support, confidence and other interestingness criteria which are used for evaluating a rule, can be thought of as different objectives of association rule mining problem. Support count is the number of records, which satisfies all the conditions that exist in the rule. This objective represents the accuracy of the rules extracted from the da...

متن کامل

Numeric Planning via Abstraction and Policy Guided Search

The real-world application of planning techniques often requires models with numeric fluents. However, these fluents are not directly supported by most planners and heuristics. We describe a family of planning algorithms that takes a numeric planning problem and produces an abstracted representation that can be solved using any classical planner. The resulting abstract plan is generalized into ...

متن کامل

Measuring Psychological Uncertainty: Verbal Versus Numeric Methods

The authors argue that alternatives to the traditional numeric methods of measuring people's uncertainty may prove to hold important advantages under some conditions. In 3 experiments, the authors compared verbal measures involving responses such as very likely, and numeric measures involving responses such as 80% chance. The verbal measures were found to show more sensitivity to various manipu...

متن کامل

A Hybrid Relaxed Planning Graph'LP Heuristic for Numeric Planning Domains

Effective search control for numeric planning domains, in which appropriate numeric resource usage is critical to solving the problem, remains an open challenge in domainindependent planning. Most real-world problems rely on metric resources such as energy, money, fuel or materials. Despite the importance of numbers, few heuristics have been proposed to guide search in such domains. Hoffmann’s ...

متن کامل

ارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها

Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007